A data mining framework for detecting subscription fraud in telecommunication
نویسندگان
چکیده
Service providing companies including telecommunication companies often receive substantial damage from customers’ fraudulent behaviors. One of the common types of fraud is subscription fraud in which usage type is in contradiction with subscription type. This study aimed at identifying customers’ subscription fraud by employing data mining techniques and adopting knowledge discovery process. To this end, a hybrid approach consisting of preprocessing, clustering, and classification phases was applied, and appropriate tools were employed commensurate to each phase. Specifically, in the clustering phase SOM and K-means were combined, and in the classification phase decision tree (C4.5), neural networks, and support vector machines as single classifiers and bagging, boosting, stacking, majority and consensus voting as ensembles were examined. In addition to using clustering to identify outlier cases, it was also possible – by defining new features – to maintain the results of clustering phase for the classification phase. This, in turn, contributed to better classification results. A real dataset provided by Telecommunication Company of Tehran was applied to demonstrate the effectiveness of the proposed method. The efficient use of synergy among these techniques significantly increased prediction accuracy. The performance of all single and ensemble classifiers is evaluated based on various metrics and compared by statistical tests. The results showed that support vector machines among single classifiers and boosted trees among all classifiers have the best performance in terms of various metrics. The research findings show that the proposed model has a high accuracy, and the resulting outcomes are significant both theoretically and practically. & 2010 Elsevier Ltd. All rights reserved.
منابع مشابه
Data Mining Approach For Subscription-Fraud Detection in Telecommunication Sector
This paper implements a probability based method for fraud detection in telecommunication sector. We used Naïve-Bayesian classification to calculate the probability and an adapted version of KL-divergence to identify the fraudulent customers on the basis of subscription. Each user’s data corresponds to one record in the database. Since, the data involves continuous numerical values, the NaïveBa...
متن کاملDetecting Telecommunication Fraud using Neural Networks through Data Mining
-Neural computing refers to a pattern recognition methodology for machine learning. The resulting model from neural computing is often called an artif icial neural network (ANN) or a neural network. Neural networks have been used in many business applications for pattern recognition, forecasting, prediction and classif ication. Neural network computing is a key component for any data mining too...
متن کاملPresenting a framework for detecting fraud risk factors affecting fraud occurrence in banks (Case study: Resalat Banks in Isfahan, Iran)
The present study aimed to investigate fraud risk factors affecting fraud occurrence in the branches of Resalat Bank in Isfahan, Iran, in 2017. The study is an applied research as far as the purpose is concerned, and a descriptive survey study as far as the procedures for data collection are concerned. The population of the study comprised experts in accounting computer information system, expe...
متن کاملData mining techniques for Fraud Detection
Due to the dramatic increase of fraud which results in loss of billions of dollars worldwide each year, several modern techniques in detecting fraud are continually evolved and applied to many business fields. Fraud detection involves monitoring the behaviour of populations of users in order to estimate, detect, or avoid undesirable behaviour. Undesirable behaviour is a broad term including mis...
متن کاملCombination of Ensemble Data Mining Methods for Detecting Credit Card Fraud Transactions
As we know, credit cards speed up and make life easier for all citizens and bank customers. They can use it anytime and anyplace according to their personal needs, instantly and quickly and without hassle, without worrying about carrying a lot of cash and more security than having liquidity. Together, these factors make credit cards one of the most popular forms of online banking. This has led ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Eng. Appl. of AI
دوره 24 شماره
صفحات -
تاریخ انتشار 2011